INTRODUCTION

QUESTIONS

  1. What are the leading causes of death since 2001?
  2. Which state had the most amount of deaths in the US?
  3. What was the leading cause of death in California?
  4. How has the cause of heart disease deaths changed over time?
  5. How has the cause of cancer deaths changed over time?
  6. We know the adjusted death rate for the data, but what does the rate look like when we subtract “all causes’?
  7. What is the death rate by cause and state?
  8. What is the age-adjusted death rate for heart disease?
  9. Is Florida mainly older people?
  10. What is the comparison between deaths and age adjusted death rate?

Variables in the dataset:

Leading Causes of Death Since 2001

This plot shows the total summation of all Deaths from 1999 to 2017. We can see that there are many deaths due to Heart Disease and Cancer. The least amount of deaths being by suicide. Because Heart Disease and Cancer were the top causes of death across all 50 states, it led us to our next questions: Which state experienced the most deaths? And, are Heart Disease and Cancer also the two leading the causes in that state?

Total Deaths in Each State

This plot illustrates the total deaths throughout every state from 1999 to 2017. California had the most amount of deaths, though this can be due to the high population in California. Then we see that Florida and Texas follow up behind California. Even though Florida has a smaller population than Texas, the amount of deaths overpassed Texas’ deaths.

Each Disease Percentages in Every State

This heatmap shows the percentage that each disease accounts for in each state. This percentage is calculated by dividing the number of deaths from a specific cause by the total number of deaths in each state. We can see that in some states there is a higher percentage of people dying from a certain disease than others. For example, we see a higher percentage in unintentional injuries in Alaska than any other state. Similarly, Heart Disease accounts for a large share of deaths in New York.

Leading Cause of Deaths in California

Since California had the highest amount of deaths in the United States, we chose to take a closer look. As shown, Heart Disease and Cancer have consistently been the leading causes of death in each state.

Changes Throughout the Years

## `summarise()` has grouped output by 'Cause.Name'. You can override using the
## `.groups` argument.

We decided to create a comprehensive graph that contains all causes of death over time to be able to identify which have changed the most. Heart disease started off with a high amount of deaths and has slowly decreased throughout time. In contrast, Cancer shows a steady, linear increase in deaths through the years. It is imperative to note that though the number of deaths caused by Heart Disease has slowly decreased over time, it continues to be a top leading cause of death. For the other diseases, we see a constant amount of deaths, some decreasing like stroke, and some increasing like unintentional injuries and Alzheimer’s disease.

How has the number of heart disease deaths changed over time?

Year Total Deaths
1999 1,450,384
2000 1,421,520
2001 1,400,284
2002 1,393,894
2003 1,370,178
2004 1,304,972
2005 1,304,182
2006 1,263,272
2007 1,232,134
2008 1,233,656
2009 1,198,826
2010 1,195,378
2011 1,193,154
2012 1,199,422
2013 1,222,210
2014 1,228,696
2015 1,267,684
2016 1,270,520
2017 1,294,914

This plot shows a steady decline in deaths caused by Heart Disease throughout time. Because of increased public awareness and access to preventive care, it is evident that individuals have began seeking help and leading a more healthier lifestyle. Notably, between the years of 2010 and 2012, there is a visible dip in deaths, which aligns with the launch of the Million Hearts initiative by the United States Department of Health and Human Services. Their main goal was to prevent 1 million heart attacks and strokes by 2017. The earlier decline, prior to 2010, can be attributed to the improved public health messaging, reduced smoking rates, and healthier diet habits.

How has the number of cancer deaths changed over time?

Year Total Deaths
1999 1,099,676
2000 1,106,182
2001 1,107,536
2002 1,114,542
2003 1,113,804
2004 1,107,776
2005 1,118,624
2006 1,119,776
2007 1,125,750
2008 1,130,938
2009 1,135,256
2010 1,149,486
2011 1,153,382
2012 1,165,246
2013 1,169,762
2014 1,183,400
2015 1,191,860
2016 1,196,076
2017 1,198,216

This data shows a consistent increase caused by Cancer throughout the years. However, there is a dip in 2004. This is due to some states statistical data not meeting the requirements to be included to the US data. Since this happened, the amount of deaths reported appeared lower due to some states not meeting the requirements to input their data for that year.

We know the adjusted death rate for the data, but what does the rate look like when we subtract “all causes’?

State Year Total Deaths Known Cause Deaths Unclassified Deaths
Alabama 2017 53,238 39,366 13,872
Alaska 2017 4,411 3,118 1,293
Arizona 2017 57,758 42,928 14,830
Arkansas 2017 32,588 25,233 7,355
California 2017 268,189 206,761 61,428
Colorado 2017 38,063 27,626 10,437
Connecticut 2017 31,312 22,103 9,209
Delaware 2017 9,178 6,902 2,276
District of Columbia 2017 4,965 3,581 1,384
Florida 2017 203,636 152,459 51,177

This plot contains a side-by-side comparison of each leading cause of deaths since 2001. The plot on the left displays the total number of deaths attributed to the top 10 leading causes, while the plot on the right shows the number of unexplained/unclassified deaths labeled as “all causes” in our data set. The unexplained deaths likely represent causes not included based on the top 10 and therefore not specifically categorized.The total amount of deaths is from all residents death certificates that were filed through this time. Notably, the number of deaths in the “all causes” category has steadily increased over time.

What is age adjusted death rate?

  • Different states or regions have different age structures.

  • Older populations naturally have higher death rates so comparing raw death rates across states would be misleading.

  • To make fair comparisons across the states, public health stats use age adjustment instead of uing raw numbers.

What is the death rate by cause and state?

This plot represents the average age-adjusted death rates of these diseases and their causes. The averages aid in providing an accurate presentation of what each of these diseases consist of. For some diseases, such as diabetes, Alzheimer’s, and CLDR, the rates appear consistent across states. In contrast, the death rates for Heart Disease and Cancer are more widely scattered which would indicate a significant variation between states. Some of these differences can the attributed to some factors like population age, access to healthcare, and lifestyle.

This plot represents the average death rate for Heart Disease across each state in the United States. Notably, Mississippi has consistently had the highest death rate throughout the years. Mississippi has implemented some changes for the community such as The Mississippi Chronic Illness Coalition (MCIC) to help improve the amount of deaths caused by Heart disease. Its main focus is on community education, healthcare provider training, and the fight to improve cardiovascular health. This link provides more information on their plans to prevent heart disease: MSDH Heart Disease Prevention Plan.

How Does Total Amount Of Deaths Reflect Onto Age Adjusted Death Rate?

## Coordinate system already present. Adding new coordinate system, which will
## replace the existing one.

When looking at all these graphs throughout the time, we can see the amount of deaths and the Age Adjusted Death Rate together. We see that through many of the death causes like CLRD, Stroke, Diabetes, Influeza and Pneumonia there has been a decrease throughout time. We see a huge decrease in the cause stroke also. Strokes can be caused if you have diabetes, it is an underlining of strokes because it can damage your blood vessels and cause your blood to cloth. We can see that diabetes has also decrease. Therefore these two are somewhat reflecting off of each other to an extent.

A Closer Look Into Heart Disease and Cancer

When we look at this data, we can notice that there has been a change in Heart Disease and Cancer throughout these years. When I searched it up a bit more, it says that there can be changes throughout the years due to better technology. Due to better technology, these diseases are faster to detect. Therefore the age gap between the amount of deaths is closing in. Creating a greater percentage for those who have an older population. While deaths may be high, this can be due to the population increasing over the time.

A Closer Look Into Florida

## Coordinate system already present. Adding new coordinate system, which will
## replace the existing one.

While presenting we were asked about Florida specifically, mainly about the age in Florida. We were not given ages, but we can look at the Age Adjusted Death Rate and see that the percentages have increased. Alzheimer’s disease is mainly diagnosed on people of older age. (Ages 65+) This disease has had an increase in Florida, so yes we can say that older people do live in Florida. Before though, not many people. The Age Adjusted Death Rate didn’t see in increase until 2011, therefore now there are probably more older people moving to Florida.

LIMITATION

This project has been very interesting and insightful experience to work on, though there are some limitations. I think it would’ve been better if our data also included ages. We think this would’ve helped a lot in some of the questions that we were receiving from people. Ages play a big roll into these diseases, we are given a bit by the Age Adjusted Death Rate but not to the fullest (Specifics on age range). I also think that it would’ve been nice to have the specific type of Cancer. We are given a more generalized name for Cancer, it is all in one category instead of being a general Cancer name and a specific Cancer name. Another limitation that led to less specificity in our data would be the “all causes” category. It was unknown what it consisted of and it would have bene helpful to have a clearer breakdown beyond just the 10 causes.

SUMMARY

Through this project, we were able to gain an insight into the impact of each of these diseases and how they contribute to the mortality across every state. Regardless of the state, there was a high amount of heart disease and cancer deaths as the top two leading causes of death. Fortunately, there have been countless initiatives on a national and state level that have been implemented to prevent these diseases, or at the very least slow the rate in which they occur with early identification. We hope that this project encourages people to take action on managing their health, as many of these diseases can now be detected at a earlier state, thanks to advancements in technology.